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ABSTRACT 

We explore the cosmological constraints expected from wide area XMM-type cluster sur- 
veys covering 50-200 deg 2 , under realistic observing conditions. We perform a Fisher matrix 
analysis based on cluster number counts in combination with estimates of the 2-point cluster 
correlation function. The effect of the survey design is implemented through an observation- 
ally well tested cluster selection function. Special attention is given to the modeling of the 
shot noise and sample variance, which we estimate by applying our selection function to nu- 
merically simulated surveys. We then infer the constraints on the equation of state of the dark 
energy considering various survey configurations. We quantitatively investigate the respective 
impact of the cluster mass measurements, of the correlation function and of the 1 < z < 2 
cluster population. We show that, with some 20 Ms XMM observing time, it is possible to 
constrain the dark energy parameters at a level which is comparable to that expected from the 
next generation of cosmic probes. Such a survey has also the power to provide unique insights 
into the physics of high redshift clusters and AGN properties. 

Key words: cosmology: observations - cosmology: theory - clusters: general - cosmological 
parameters 
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1 INTRODUCTION 

The statistical properties of galaxy clusters provide independent 
cosmological information, complementary to that inferred from 
other observations such as measurements of the Cosmic Microwave 
Background (CMB), Supernova Type la (SN la), Baryon Acoustic 
Oscillations (BAO) and weak lensing (WL) data. Clusters are the 
largest virialized objects (dark matter halos) in the Universe, with 
mass scales corresponding to overdensities that enter the non-linear 
phase of gravitational collapse between redshifts < z < 3. Con- 
sequently, their abundance and spatial distribution can potentially 
probe both the cosmic expansion history as well as the growth of 
cosmic structures. Theoretical considerations such as the predic- 
tion of the halo mass func t ion based on semi -analytical approaches 
jPress & SchechteJll974l: iBond et al J 1 1 99 lh and N-body simula- 
tions (see e.g. lShefh & Tormenlll999l) have suggested that cluster 
statistics is particularly sensitive to the normalization of the mat- 
ter power spectrum as (the root-mean-square of linear fluctuations 
within a sphere of 8h~ Mpc radius) and the total cosmic matter 
density S7 m . These observational aspects have given a strong incen- 
tive to the use of clusters as cosmic probes. 

Over the past decades cluster observations have greatly 
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evolved. After the pioneering studie s of the Einstein Medium 
Sensitivity Survey dGioia et all I"l990). the Rosat All-sky survey 
(RASS) and deep ROSAT pointed observations have provided an 
invaluable reservoir of clusters out to redshift ~ 1. These mea- 
surements enabled the first deter minations of ag and fl m based on 
cluster number counts alone (seelEvrardll989llOukbir & Blanchardl 
1 1 992l: IWhite et al J 1 1 9931: 1 Viana & Liddlel 1 1 9961: lEke et al J 1 1 99a 
Henrvl Il997l l200d : iBorgani et al.l l200ll : Ivikhlinin et alj I2OO3I: 
I Allen et al.l 120031) and in combination wi th measurements of th e 
local correlation function from RASS dSchuecker et alj 120031) . 
Similarly the Sloan Digital Sky Survey cluster cata logue offered 
the fi rst determination using an optical dataset (e.g. iBahcall et al.l 
2003). Quite remarkably these measurements have always con- 
sistently pointed out to a low matter densit y universe, in agree- 
ment with results fr om galaxy survey data jPercival et al. 200 ll: 
[ Tegmark et al.ll2004l) and CMB observations dDe Bernardis et al.l 
I2OOOI: ISpergel et aljl2003l) . With the launch of XMM and Chan- 
dra a decade ago, a new era has begun: deep pointed observations 
of large cluster samples, mainly extracted from the ROSAT cata- 
logues, have provided detailed insights into the baryonic physics 
of clusters and their morphology. This has resulted in a tremen- 
dous burst in the modeling of the cluster properties as well as in the 
determination of their mass. These advancements have led to im- 
proved constraints on erg and Sl m , as obtained for example using the 
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temperature function of local bright clusters dHenrvetalj|2009h . 
Fro m the point of view of large area surveys, the XMM-LSS sur- 
vey dPierre et alJl2004) covering some 1 1 deg 2 performed pioneer- 
ing cluster detection work, assembling a complete sample of XMM 
clusters at a sensitivity of ~ 1CP 14 erg cm~ 2 s _1 in the [0.5-2] 
keV band. Moreover, it provided detailed insights abo ut the impact 
of se lection effects on cluster evolutionary studies jPacaud et al.l 
l2007b . 

The discovery of dark energy has generated a revived interest 
in the use of cluster statistics as an alternative test for probing the 
nature of this exotic component. Dark energy can directly affect the 
cluster number counts by modifying the growth rate of structures 
as well as the size of the cosmological volume probed at a given 
redshift dWang & Steinhardlll998l ; lHaiman, Mohr & Holdej200lt 
iHuterer & Turnedl200ll) . Several works have attempted to measure 
the dark energy equati on of state u s ing cluster data in combina- 
tion with other probes dHenry ||2004; Mantz 2008; Vi khlinin et al.l 
l2009l : lAllen et al. ll2008l ; lRozo et al.ll2009l) . However statistical and 
systematic uncertainties, as well as the presence of degeneracy be- 
tween cosmological parameters, remain the major limitations to ac- 
curately test dark energy with current data. 

From an observational point of view, the main quantities that 
are useful to constrain cosmology are: the redshift evolution of the 
cluster number counts (dn/dz) or ideally the evolution of the clus- 
ter mass function (dn/dMdz), the spatial distribution of clusters 
(e.g. the two-point correlation function, f), the cluster temperature 
function, the gas mass fraction in clusters as well as various scal- 
ing laws describing the evolution of cluster structural properties. 
There are two key practical issues that such studies have to face: 
firstly the ability to assemble well characterized cluster samples, 
and secondly the need for well understood mass-observable rela- 
tions, since for a given cosmology the cluster mass is the only in- 
dependent variable entering the theory. Mass estimates can be in- 
ferred from a variety of methods: optical richness, galaxy velocity 
dispersion, X-ray luminosity or temperature, S-Z decrement, weak 
lensing signal or from more elabo rated proxies such as Tx x M gaa 
described in dKravtsov et alj|200^) : if X-ray temperature and gas 
density profiles are available, masses can be calculated under the 
hypothesis of hydrostatic equilibrium. 

Depending on the number of cosmological parameters that 
one aims at constraining and the required accuracy, the minimum 
size of useful cluster samples ranges from 50-100 objects for con- 
straining as and fi m only, to several hundreds or even several thou- 
sands (if little information is available on masses) for constraining 
the dark energy parameters. 

Since clusters constrain regions of the cosmological parame- 
ter space which are complementary to that pro bed by other tests 
such as SN la, CMB, BAO and WL data (see e.g. lHuterer & Turned 
1200 lh . considerable efforts have been devoted, both theoreti- 
cally and observationally, to characterize the use of clusters in 
the near future. Forecasts of the dark energy parameter uncer- 
tainties from future optical, X-r ay and S-Z surveys have been 
the subject of sever a l analyses (Weller, Battye & Kneissl 2002; 
Hu & Kravtsovll2003l:lMaiumdar & MoMl2003l . l2004l ; IWang etal] 
2004; Wu. Rozo & Wechslen 120081) . These studies, generally fo- 



cusing on surveys covering a few 1 000 deg , have shown that 
precision cosmology in the context of cluster surveys is certainly 
possible in the near future. Subsequently, there has been a grow- 
ing interest in evaluating the impact of systematic uncertainties of 
such cluster surveys. For instance, one can mention the sensitiv- 
ity of the dark energy c onstraints to halo modeling uncertainties 
dCunha & Evrard 2009) or to the mass accuracy of given cluster 



sub-samples; the latter is of special relevance when designing the 
follow-up observations to increase the cluster mass accuracy: given 
that telescope time is limited, it is neces sary to optimise the target- 
ing of specific mass and redshift ranges dWu et alj|20ld) . 

While these prospective dark energy studies pertain to upcom- 
ing or future instrumentation, we examine here the potential of 
XMM, whose characteristics and capabilities are now very well es- 
tablished. In fact, with its outstanding collecting area (~ 2000 cm 2 
on axis at 1 keV), wide spectral range ([0.1-10] keV), good spa- 
tial (~ 6 arcsec on axis) and spectral (5-10% at 1 keV) resolution, 
XMM appears to be the best suited, currently available, X-ray ob- 
servatory to undertake a large cluster survey. As an example, with 
10 ks exposures, XMM reaches a sensitivity which is about 1000 
times greater than RASS, i.e. 5 x 10 _15 erg cm -2 s _1 in [0.5-2] 
keV for point sources. Basically, XMM has the power to unambigu- 
ously resolve any cluste£] provided that at least some 100 photons 
are collected. 

In this paper, we forecast the dark energy parameter errors 
for an XMM cluster survey with an area of the order of 100 deg 2 . 
Using results from accurate survey simulations and precise model 
predictions, we estimate the dark energy parameter errors for dif- 
ferent survey configurations. We find that the expected parameter 
constraints are not only complementary to those of other cosmo- 
logical probes, but competitive with respect to forecasted errors for 
the next generation of dark energy dedicated experiments. 

Compared with other cluster surveys, X-ray observations 
have an indisputable advantage, since cluster X-ray properties 
can be predicted ab initio for a given cosmological model, with 
observational input (e.g., mass-observable relations) being easily 
implementable. In contrast, ground-based large optical cluste r 
surveys (e.g., SDSS Max BCG catalogue, iKoester et ail 120071) . 
though may appear much more attractive because of their lower 
cost, still require ad hoc prescriptions to evaluate the cluster 
selection function with cosmological numerical simulations. 
Such procedures usually rely on the optical richness as defined 
by the galaxy distribution. We want to stress that computing a 
cluster survey selection function in the era of precision cosmology 
requires a self-consistent modeling of the selection function itself. 
We will show here that this plays a critical role in the interpretation 
of the cluster number counts. It is also worth mentioning that, 
after 40 years of experience, X-ray cluster surveys are still much 
ahead of S-Z surveys both in terms of detection rates and for the 
evaluation of the selection function. In the following we shall refer 
to the discussed survey as the XXL survey. 

The paper is organized as follows. In Section [2] we introduce 
the basic equations for the cluster survey observables, namely the 
cluster number counts and the 2-point correlation function. In Sec- 
tion [3] we describe the survey configurations and selection func- 
tions, while in Section|4]using numerical simulations we estimate 
the expected experimental survey uncertainties. In Section[5]we de- 
scribe the Fisher matrix calculation performed to infer the expected 
cosmological parameter constraints, and discuss the results in Sec- 
tion [6] Finally we present our summary and conclusions in Sec- 
tion [7] Throughout the paper, we consider th e ACDM cosmology 
with the parameters determined by WMAP-5 ( Dunkle v'et alj2009l) 
as our fiducial cosmological model. 



1 A core radius of 150 kpc corresponds to an apparent diameter of 35 arcsec 
at z = 2, to be compared to the XMM on-axis PSF of 6 arcsec 
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2 CLUSTER SURVEY OBSERVABLES 

The number of clusters as function of redshift is given by 



dn . _ d 2 V , . 



where AO, is the survey solid angle, d 2 V/ dfldz is the cosmological 
volume factor, F S (M, z) is the redshift dependent survey selection 
function and dn/d log M is the comoving density of halos of mass 
M. 

The volume factor in a Friedmann-Robertson- Walker space- 
time reads as 



d 2 V 



c (l + z) 2 dl(z) 



(2) 



dttdz H E(z) 
with c the speed of light, Ho the Hubble constant today, and 

e(z) = V^ m (i+z) 3 + a(i+ z ) 4 + fi DE /DE(V) + n k (i+ z ) 2 , 

(3) 

where Q m , f2 r , Ode, Oj, are the present matter, radiation, dark en- 
ergy, and curvature densities, in units of the critical density respec- 
tively, and d&(z) is the angular diameter distance. The function 
Ide{z) depends on the model of dark energy. We consider three 
scenarios: 1) cosmological constant A, with Ia(z) = 1; 2) dark en- 
ergy fluid characterized by a constant equation of state w for which 
7de(z) = (1 + z) 3 ( 1+w ^; 3) time evolving dark energy equation 
of state parametrized in the f orm woe(z) = wo + w a z/(l + z) 
dChevallier&Po larski 200lL iLinded l2003h for which Jde(z) = 



(l + z 



3(l + w +w a ) 



exp [—3w a z/(l + z)]. 



Cluster DE studies make various assumptions as to the selec- 
tion function. It can be defined by a simple mass limit, depending or 
not on redshift and cosmology; the limit is supposed to be step-like 
or to allow for a possible di spersion and for some smooth func- 
tion across the threshold (e.g.lLima & Hvfeoojl ; iHu & Cohrj|2006l ; 
lAlbrecht et al. 1120061 : iBasilakos et alJl20ld) ^For an X-ray survey, 
M\ijn(z) is determined using a mass-observable relation, for in- 
stance the mass-luminosity relation. Because of this, the limiting 
mass depends on empirically determined parameters (parametriz- 
ing the physics of the hot gas in clusters), and the luminosity dis- 
tance, i.e. on the underlying cosmological model. This is an impor- 
tant point, since it implies that to properly infer cosmological con- 
straints from cluster data one must take into account the cosmolog- 
ical dependence of the mass threshold Mii m (a). Another relevant 
aspect concerns the fact that survey design, instrumental character- 
istics and imaging technique analysis, cause the cluster selection 
function not to behave as a step function. This can affect the pre- 
dicted number counts and 2-point correlation statistics for a given 
cosmological model. The effect of a non ste p-like function has 
been studied in th e literature for S-Z surve ys (Holde r et alj|200fj : 
iLin & Mohj2003h and X-ray surveys JSahlen et alj2009h . Here we 
use realistic selection functions derived from accurate simulations 
of the XMM-LSS survey that will be discussed in Section[3] 

The comoving density of halos of mass M at redshift z reads 

as 



dn(M, z) 
d log M 



p m rilogq- 
M d log M 



(4) 



with p m the present mean matter density, <j(M, z) the root-mean- 
square fluctuation of the linear density contrast smoothed on a scale 
R = (3M/47r/O m ) , and /(<r, z) the multip licity function. Her e 
we adopt for f(a, z) the modeling proposed bv lTinker et alj (l2008). 
Our working assumptions are detailed in Appendix [A] 
The variance of the linear density contrast smoothed on scale R at 



redshift z is given by 

a 2 {R,z) = A 



2, n , 1 2 I dk „ _>-,,.' , , ,u--, 
2tt 2 



T k n " +2 T 2 (k,z)W 2 (kR), (5) 



where A is a normalization constant fixed so that today a(R = 
= (J8, n s is the scalar spectral index, T(k, z) is the 
linear matter transfer function and W(kR) is the Fourier trans- 
form of the real space top-hat window function. We compute 
the matter transfer fun ction using the fitting formula provided by 
lEisenstein&Huld 19981) . which includes the wave pattern imprinted 
by the baryon acoustic oscillations. 

On large scales the 2-point spatial correlation function for a cluster 
survey covering the redshift range [z m in, Zmax] is given by 



i::::^{z)dz ■ 



where 



(6) 



(7) 



and £(R,z) = b 2 s (z)£n n (R, z), with £y ln (R, z) the Fourier 
transform of the linear matter power spectrum at redshift z. The 
evolution of the line ar bias averaged over all halos reads as 
( Matar rese et al J 19971) 

b aS {z) = -L f°° Fs ( M , z)b(M, z) d ^ M \f d log M, (8) 
n(z) Jo dlogM 

where b(M, z) is the linear bias relating dark matter halos of mass 
M to the mass density fluctua tion. We assume the bias model in- 
troduced in lTinker et al. (2010), 



b(M,z) = 1 - + 0.183 ( - 



B b 



, (9) 



with S c = 1.686 the critical linear overdensity given by the spheri- 
cal collapse model, and the fitting parameters given by 

A = 0.24ye X p[-(4/y) 4 ], (10) 
a b = 0.44(j/-2), (11) 
B b = 0.019 + 0.107y + 0.19exp[-(4/y) 4 ], (12) 

where y — log 10 (A m ) with A m the nonlinear overdensity thresh- 
old. Our fixed value of S c is only exact for an Einstein-de Sitter 
universe - although it hardly varies with the c osmology. Neverthe- 
less, we preferred to follow the convention of iTinker et alj d2010l) 
and fix it. 



3 XXL SURVEY CHARACTERISTICS 

Cluster surveys are, similarly to galaxy surveys, defined by a num- 
ber of parameters such as sky coverage and geometry, depth, selec- 
tion function, and redshift accuracy. On the other hand, compared 
to galaxies, clusters are rare objects, a characteristic that has a sig- 
nificant impact on the determination of the correlation function. 
Moreover, as already mentioned in the introduction, cluster mass 
accuracy plays an important role in the determination of the cos- 
mological parameters. In this section, we present the generic char- 
acteristics of the XXL survey, while a quantitative examination of 
the various sources of uncertainty will be presented in Section|4] 
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3.1 Two survey designs 

In this case study, we examine the merits of two possible XMM 
survey concepts: Survey-A covers a total sky area of 50 deg 2 with 
40 ks XMM pointings, this configuration is assumed to allow mass 
measurements at the 10-50% level for the selected cluster samples; 
Survey-B covers 200 deg 2 with 10 ks XMM pointings and provides 
a cluster mass accuracy of 50-80%. Possible survey configurations, 
resulting from various splitting in sub-regions, are summarized in 
Table lBll and discussed in Section l4~2l 

To give an order of magnitude of the observing time neces- 
sary to perform these surveys, one can imagine mosaics consisting 
of XMM observations whose center are separated by 20 arcmin in 
RA and Dec, so that 9 observations are necessary to cover ldeg 2 . 
Consequently, both surveys A and B correspond approximately to 
~ 18 Ms net observing time, i.e. some 180 XMM (2-day) revolu- 
tions, allowing for 10 observations per revolution with the mosaic 
mode. 

3.2 Modeling the cluster population as seen by XMM 

Before detailing the survey selection function, we need to specify 
how the two basic observable quantities, the X-ray count rate in a 
given band and the apparent size of the cluster sources, relate to 
the cluster mass as a function of redshift. 

In the following, we assume the usual [0.5-2] keV range as 
the working detection band, since it presents the optimal S/N, given 
the cluster sp ectra, the bac kground spectrum and the XMM spec- 
tral response dSchadl2002h . Furthermore we assume the observed 
cluster scaling laws between luminosity (L), temperature (T) and 
the mass within a radius containing an overdensity of 200 times 
the critical density (Mg o pe) as determined in the local universe 
dArnaud & Evrardl 1 19991 ; lArnaud et all l2005h . and use the self- 
similar prescription for their evolution. To account for the scatter 
observed in cluster properties, we encapsulate the dispersion of the 
M — T and L — T relations in the M — L rela tion, for simplic- 
ity. Following the analysis bv lStanek et al.l d2006t) . who measured 
g\xxM\l = 0.37, we use (Tin £ | ^ ~ 0.37x 1.59 ~ 0.6, where 1.59 
is the slope of their M — L relation. To assign the X-ray luminosity 
we assume a log-normal distribution. These prescriptions allow us 
to compute the flux, and finally the count rate as function of the 
cluster mass and redshift. The impact of these hypotheses will be 
discussed in Sec.|6]and|7] 

Fluxes are estimated using the APEC thermal plasma model, 
assuming a fixed hydrogen column density of 2.6 x 10 20 cm -2 and 
setting the heavy element abundance to 0.3 solar. Fluxes are sub- 
sequently folded with the telescope and detector response (EPIC 
response matrices) assuming the THIN optical blocking filter. This 
allows us to predict the observed count-rates. We further assume a 
/3-profile for the gas distribution, with /3=2/3 and a constant physi- 
cal core radius of 180 kpc, unless otherwise specified. This finally 
yields the spatial distribution of the cluster counts on the detectors. 

3.3 The cluster selection function 

We now turn to the description of the selection function. The 
ability to select clusters upon well-defined X-ray criteria is a 
key issue: as shown in Section [2] the selection function directly 
enters into the modeling of the cluster number counts and spatial 
correlation function. 




0.04 0.06 0.08 0.10 

M1+M2 + PN count-rate (cts/s) 



Figure 1. The CI cluster selection function derived from extensive simula- 
tions: the probability of cluster detection is expressed in the count rate (~ 
flux ) - core radius plane. A /3-model with /3 = 2/3 is assumed. 



In this prospective study, we adopt the C1/C2 selection 
functions specifically determined for the XMM-LSS survey. 
These have been extensively tes ted on the basis of XMM image 
simulati ons jPacaud et alj l2006h and applied to the XMM-LSS 
sample jPacaud et alj 120071) . The selection basically operates in 
the [extent, extent likelihood] X-ray pipeline parameter spacfl 
where extent is taken to be the core radius of the /3-model. The 
procedure allows us to assemble samples of extended X-ray 
sources that have a well-defined degree of contamination by 
miss-classified point-source; these can be easily discarded a 
posteriori by examining the X-ray/optical overlays. We define 
two samples, CI and C2, for which the con tamination is ~ 
and ~ 50% respectively JPierre et alj 120061) . This procedure, 
which operates in a two-dimensional parameter space, enables 
the construction of uncontaminated cluster samples significantly 
larger than those obtained by a simple flux limit. The selection 
criteria are subsequently converted into the probability of detecting 
a source characterized by a given core radius and flux. The CI 
selection probability function is displayed in Fig. [TJ Using the 
cluster model described in the previous paragraph, we derive the 
limiting cluster mass detectable as a function of redshift for CI and 
C2 respectively. Since the current C1/C2 selection criteria have 
been defined for 10 ks XMM exposures, the resulting selection 
corresponds to clusters having A'hooc > 2 x 10 14 M@, thus 
relatively massive objects as can be seen in Fig. [2] Moreover 
we note that Afi im (Cl) ~ 1.5 x Mi im (C2) for z > 0.2, with 
the C2 selection yielding about twice as many clusters as the CI 
selection. Notice that the CI sample is always a sub-sample of 
the C2 selection. The number of collected cluster counts at the 
detection limit is displayed in Fig. [3] 

Practically, our cosmological analysis will be performed in 
two stages, (i) In a first step, we consider the same cluster selec- 
tion functions independently of the survey configuration (A or B). 
This means that for configuration A, the sample is defined from 



2 Because of the limited number of source photons, the pipeline operates 
in Cash statistics and returns, for each source parameter, the likelihood of 
the measurement 
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Figure 2. The limiting detectable cluster mass as a function of redshift. A 
detection probability of 80% is assumed. Masses are expressed in terms of 
A^200c i the mass within a radius containing an overdensity 200 times the 
critical density. 




Figure 3. Number counts (2 MOS + pn) collected in 40 ks (Survey-A con- 
figuration) from a CI and C2 cluster in the [0.2-8] keV energy range, as 
a function redshift. A detection probability of 80% is assumed, thus corre- 
sponding to the My lm (z) of Fig. [2] The EPIC sensitivity has been averaged 
over the inner r = 10 arcmin (mean vignetting of 0.69). Assuming that 
half of the collected photons are used for the spectral analysis, our selection 
ensures that at least 500 counts are available for temperature determination 
with 40 ks XMM exposures. 



Table 1. Properties of the cluster samples selected for the cosmological 
analysis 
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C20 
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23.2 


28.2 
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20% detection probability 

50% detection probability 

80% detection probability 



Figure 4. Same as Fig.|2]for the C20 population 



iFinoguenov et alj d2007l) . The characteristics of the C20 clusters 
are displayed in Figs. [4] and [5] The number densities of the CI, C2, 
C20 populations are given in Table [T] Furthermore, we define the 
following sub-classes: we refer to C2' for C2 clusters not detected 
as CI and, similarly, to C20' for the C20 clusters not detected as 
C2. 



4 ESTIMATING MEASUREMENT UNCERTAINTIES 

We provide in this section a detailed account of the uncertainties 
pertaining to the measurements of clusters masses, cluster number 
counts and 2-point correlation function, as expected from the XXL 



sub-exposures of 10 ks. The main goal of the total 40 ks integra- 
tion time is to reach the X-ray spectral accuracy enabling accurate 
mass measurements. Further, at the full depth of 40 ks, Survey- 
A enables the detection of deeper cluster samples. Consequently 
(ii) in a second step, we investigate the added cosmological value 
from clusters only detected in the 40 ks observations of Survey-A. 
We thus define a C20 class, a scaled-down version of the C2 pop- 
ulation detected in 10 ks. Since the C2 selection function is well 
depicted by a detection probability as a function of S/N, we simply 
derived the C20 detection efficiency by extrapolating the results 
of Pacaud et al. (2006) to 40ks, scaling up the source S/n3 The 
density inferred for this population is on the order of 30/deg and 
comparable to that inventoried in the ~ 50ks COSMOS field by 



20% detection probability 
50% detection probability 
80% detection probability 



A This method was already applied in Pacaud et al. (2007) to account for 
the spatial variations of exposure time. 



Figure 5. Same as Fig.[5]for the C20 population 
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Table 2. Adopted mass precision for each individual cluster as a function 
of XMM exposure time. The numbers are the 1-ct errors on ln(M). The 
* indicates that this sub-population does not provide mass information for 
the Fisher analysis. Last line gives the assumed precision on the luminosity 
measurements. 



Selection 


Adopted mass accuracy 






Optimistic view 


Pessimistic view 




10 ks 


40 ks 


10 ks 


40 ks 


CI 


0.5 


0.1 


0.8 


0.5 


C2' 


0.8 


0.5 


* 


0.8 


C20' 


not detected 


0.8 


not detected 


* 


a lnL obs 


negligible 




0.2 





survey depth and geometry are quite different from that of galaxy 
or weak lensing surveys. More precisely, considering splitting the 
survey in several sub-regions (a strategy favored by practical ob- 
serving considerations), we need to estimate the trade-off between 
averaging the sample variance and the loss of S/N in the 2-pt 
correlation function at large distances. In principle, it is possible 
to analytically calculate the sample variance and the shot noise for 
dn/dz and £ as a function o f cosmology for a give n flux limited or 
volume limited survey (e.g. lHu & Kravtsovll2003l) . Having here a 
well defined selection function Mn m (z), we perform an "in situ" 
and global estimate using numerically simulated cluster samples. 
The corresponding calculations are detailed in Appendix [B] 



survey. These are the necessary ingredients for a realistic evaluation 
of the cosmological parameter errors via a Fisher matrix analysis. 



4.1 Accuracy of the cluster mass and redshift measurements 

For this study, we do not rely on the, so far non-observationally 
validated, self-calibration techniques that allo w for some universal 
redsh ift-dependent mass-observable relation iMaiumdar & MohH 
2004); we discuss the relevance of this option in Sec. [7] We rather 
attribute to each cluster a mass accuracy as a function of its X-ray 
flux. The limiting collected counts for the three cluster populations, 
as indicated by Figs. [3] and [5] allow us to estimate the mass accu- 
racy reachable for each selection. We consider a pessimistic and an 
optimistic situation and further set a limit on the precision of the ob- 
served luminosities. These working hypotheses are listed in Table 
[2] It is not the purpose of the present article to discuss in detail how 
such mass accuracy will be obtained, but one can foresee a set of 
realistic observations leading to the desired precision. For instance, 
configuration B is similar to the well studied XMM-LSS design, i.e. 
a mosaic of 10 ks exposures, which allowed mass measurements to 
better than 50% for th e CI population, und er the assumption of hy- 
drostatic equilibrium ( Pacaud et al. 2007). Improved cluster mass 
accuracy will be attained with the ad dition of weak lensin g and 
Sunyaev-Zel'dovich observations (e.g. Mah davi et alj 12007)). The 
use of several X-ray mass proxies, such as the Y% = T X M gaa 
param eter, can also greatly e nhance the precision of the mass esti- 
mates JVikhlinin et all2009l) . 

Experience with the Canada France Hawaii Telescope Legacy 
Survey (CFHTLsQ) showed that cluster photometric redshifts can 
be obtained for the C 1 and most of the C2 clusters at an accuracy 
of ~ 0.01 — 0.02 from a 5-band survey in the optical (Maz ureet al.l 
120071) . Further, with the up-coming generation of wide-field spec- 
troscopy instruments (e.g. refurbished VIMOS and forthcoming 
KMOS at the ESO Very Large Telescope) gathering redshifts of 
clusters with a density of ~ 10 — 50/deg 2 over an area of 100 
deg 2 will be easily achievable within the next decade. 



4.2 Statistical significance of dn/dz and £ 

Evaluating the impact of the survey size on the statistical sig- 
nificance of dn/dz and £ from cluster surveys deserve special 
attention. Because clusters are rare objects, the relative effects of 
shot noise, sample variance and edge effects as functions of the 



http://www.cfht.hawaii.edu/Science/CFHLS/ 



5 FISHER MATRIX ANALYSIS 

We perform a Fisher matrix analysis to quantitatively estimate 
the cosmological information that can be extracted from the two 
XMM-survey configurations (A and B). 



5.1 Method 

Here we briefly sketch the basic principle of the Fisher matrix ap- 
proach, interested readers may find more exhaustive discussions 
on it s cosmological applications in (Tegmark. Taylor & Heavens! 
ll997l : lEisenstein. Hu & Tegmarkll 19991) . 

Let us consider a set of measurements Di — {Di, Dn} 
(for simplicity let us assume them to be uncorrelated), from which 
we want to derive constraints on a set of parameters # M = 
{6i, ...,9m} in a given model AL We firstly evaluate the like- 
lihood function, L(Di\8 ll , M), and assuming a prior probability 
distribution for the model parameters, P(# M |.M), we construct 
using Bayes' theorem the posterior probability, i.e. the probabil- 
ity of the parameters given the observed data, P(8 l± \Di, M) oc 
L(Di\0^, A4)P(8 I± \A4). The posterior contains all statistical in- 
formation from which we derive the "confidence" intervals on the 
parameters M , Now, let us indicate with 0;(#m) the model predic- 
tion of the observable to be confronted with the data Di, and let <t; 
be the experimental uncertainties. Assuming Gaussian distributed 
errors, we can write up to an additive constant the log-likelihood as 
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If Ofj, are the model parameter values that maximize the likelihood, 
then we can expand Eq. J 1 3fc to second order in 58^ — 0^ — 0^ and 
obtain 
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This leads to the Fisher matrix given b}0 
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The parameter uncertainties as well as their mutual correlations 
are encoded in the covariance matrix, C M „ = F~ u , where the lcr 



5 Although we have assumed a Gaussian likelihood to derive this expres- 
sion, it is worth noting that the Fisher matrix has exactly the same shape for 
Poisson statistics. 



7 



model parameter errors are simply the square-root of the diagonal 
elements, erg = ^/C MM . These are the marginalized errors, in the 
sense that if we consider a specific parameter, e.g. 9\, then the un- 
certainty crg 1 obtained by inverting the full Fisher matrix is equiva- 
lent to that obtained by integrating the likelihood function over the 
M — 1 parameters, thus accounting for all possible parameter cor- 
relations. External priors on a given parameter can be easily imple- 
mented, e.g. suppose we want to include a cr$ 3 = 0.01 prior on the 
parameter 9g, in such a case it is sufficient to add to Eq. J X 5 b a ma- 
trix Ppu whose only non- vanishing element is P33 = 1 /og . Simi- 
larly information from other datasets can be easily implemented by 
adding the corresponding Fisher matrices. 

Using Eq. l |15l > greatly simplifies the estimation of the cos- 
mological parameter uncertainties for a given experiment. Then 
forecasting parameter errors reduces to knowing the expected ex- 
perimental/observational uncertainties (cr^), assuming a fiducial 
cosmology and computing the Fisher matrix by evaluating 
the derivative of the observable at the fiducial parameter values 
(dO/dOfilg ). The inferred errors will necessarily depend on the 
fiducial cosmology assumed; this is the case even if one runs a 
full numerical likelihood analysis over a set of randomly gener- 
ated data. Henceforth the results of this type of analysis should 
not be used for estimating the performance of experiments in 
distinguishing between different models. We refer the reader to 
( Mukh eriee etai~ll2006h for a discussion on the limitation of this 
approach in model selection problems and the solution in the con- 
text of Bayesian statistics. 

We evaluate the derivatives of the observable with the respect 
to the model parameters using the five-point stencil approximation: 

dO _ 2 Q(fl M + 56^ - Q(g M - 89^) 
<96» M ~ 3 se„ 

, Q(8 ll - 286 ll )-0(9 ll +266 ll ) 

+ m% (16) 

with steps 59 fi of order 5% on the fiducial parameter value. 

Our survey observables consist of the cluster number counts 
dn/dz given by Eq. Q} in redshift bins of size Az = 0.1 and the 
two-point spatial correlation function £(R) given by Eq. ((6}. For 
the cluster counts we consider detections in 10 or 20 equally spaced 
redshift bins in the range 0<z<lor0<z<2, while for the 
correlation function we consider the 10 < 7? (/i _1 Mpc) < 40 
scales. For each selection function we derive the expected survey 
uncertainties Oi on dn/dz and £ using the S/N calculated from 
the simulations, described in Appendix [B] These account for the 
integrated effect of the Poisson noise and sample variance. 

5.2 Fiducial cosmology and model parameters 

We assume as our fiducial cos mology a flat ACDM model best- 
fitting the WMAP-5 years data (Dun klev et al.ll2009h . specified by 
the following parameter values: Q, m h 2 — 0.1326, Q,\>h 2 = 0.0227, 
h = 0.719, n s = 0.963, cr 8 = 0.796, r = 0.087. For this model 
the expected number of clusters as function of redshift for Survey- 
A (50deg 2 ) is shown in Fig. [6] for the three selection functions. 
Fig. I7J displays the 2-point cluster correlation function. Here it is 
worth noticing that while the three functions have the same shape, 
the C2 curve has a slightly lower amplitude than CI, and higher 
than C20, consistently with the mass ranges pertaining to these 
samples (less massive objects are less clustered). 

We derive constraints on the following set of parameters: 
h, n 3 , as (ACDM), including a varying equation of state 
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Figure 6. Redshift distribution of the CI, C2 and C20 populations for the 
ACDM fiducial cosmology in the Survey-A configuration. 
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Figure 7. Two-point correlation function of the CI, C2 and C20 popula- 
tions. 

w(z) = wp + w a z/(l + z) with p arameters Wo and w a 
jChevallier & Polarskill200ll : lLindeill2003l) for w(z)CDM models. 

5.3 Modeling cluster mass uncertainties in the Fisher 
analysis 

For the Fisher analysis, our aim is to reproduce as much as pos- 
sible the observational procedure and the subsequent cosmolog- 
ical analysis. To summarize the steps: (1) clusters are selected 
in the XMM images according to a two-dimensional parameter 
space; (2) corresponding dn/dz and £ are derived; (3) each cluster 
mass is measured at a given accuracy - the mass measurements be- 
ing cosmology-dependent; (4) for a given cosmology, we compute 
dn/dz and f, the observational selection function being yet trans- 
lated in the [M,z] space following scaling laws - this is the point 
where the mass accuracy enters; (5) as already specified, we encap- 
sulate all uncertainties on the scaling laws in the M-L relation for 
the cosmological modeling; (6) the set of cosmological parameters 
giving best agreement both on dn/dz and f, describes the most 
likely cosmological model. 

Practically, in the Fisher analysis, we assume that the slope and 
the dispersion of the M-L relation are known and do not depend 
on redshift. We let, however, the normalization of the relation free 
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Table 3. Planck survey parameters. 
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Figure 8. Priors for the Fisher analysis on the normalization of the M-L 
relation as a function of redshift, for the optimistic and pessimistic cases for 
the Survey-A configuration 



as a scale factor a{z). We take one scale factor for each redshift 
bin (A(z) = 0.1), hence we have 10 or 20 nuisance parameters 
depending on the survey depth. The priors for the analysis are de- 
rived from the accuracy assumed for the mass measurements of the 
individual clusters (Table [2}; they are displayed in Fig. [8] for the 
optimistic and pessimistic cases. 



5.4 Planck Fisher Matrix 

To estimate the full cosmological yield of an XXL-survey, we 
perform a joint analysis of the cluster survey with the primary 
CMB power spectra (temperature-TT, polarization-EE and cross- 
correlation TE) soon to be measured by the Planck satellite. 

A precise assessment of the Planck capabilities would require 
to model in detail the map making and component separation pro- 
cesses. To circumvent this problem, we make the simplifying as- 
sumption that the sky images in the three bands where the CMB 
emission dominates (100, 143 and 217 GHz) are readily usable to 
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measure the power spectra, while the other bands permit a perfect 
characterization of the other contaminating s ignals. 

Following JZaldarriaga & Seliakll2007h . the noise covariance 
matrix for each I (including the cosmic variance) is then given by: 



Cov(cr,cD = 



Cov(Cf B ,Cf E ) = 



cov(cT E ,cT E ) 



Cov(C EE ,CT E ) = 



Cm{CT T ,cT E ) = 



-iPi 



■NT 



{21 + l)/ sky 

- [Ci te 
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(Cr + N^ T ){ Cl EE +N^ E )], 

^ TE i EE 2 
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Clte, 



where 



N, 



i,x 



= I> 



-!(a+i)</>;/(8io g 2) 



(17) 



(18) 



is the contribution of the instrumental noise to the uncertainty on 
the spectrum X, which results from averaging over the different 
frequency channels c, with sensitivity a c ,x and angular bean-width 
(fi c . In Table [5] we quote the assumed experimental characteristics 
for the Planck satellite, which we obtained from the mission defi- 
nition document (the so-called 'Bluebook'f^ We adopt a fractional 
sky coverage of f s ^ y = 0.8 to account for the masking of the galac- 
tic plane. 

The full CMB Fisher matrix for a set of cosmological param- 
eters is straightforwardly obtained as: 

F°r =EE^ Cot_1 ( c ^ c ")^, d9) 

where X, Y = TT, EE, TE and we sum over I values in the range 
[1,2000]. 

In practice, we compute the power spectra using the CMB- 
FAST code and some care has to be taken in order to correctly ac- 
count for the intrinsic CMB degeneracies. Indeed, the shape of the 
matter power spectrum at the recombination epoch is only a func- 
tion of the primordial power spectrum and the physical densities 
(/3 m , pb, Pr) in the early universe. Further, while the relative ampli- 
tudes of the CMB peaks depend on the details of the matter/photon 



available from the ESA web pages of the Planck mission: 
http://www.rssd. esa.int/index.php?project=Planck 
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Table 4. Fisher matrix errors on the cosmological parameters from Planck. 
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densities, the physical scale of the baryon oscillation pattern is sim- 
ply proportional to the sound horizon at recombination (r s ). As a 
consequence, the CMB observables only depend on h, Que, wq 
and w a through the so-called CMB acoustic scale: 

Z a = 7r(l + 2dcc )^^ (20) 

where Zdcc is the redshift of decoupling and d a the angular diameter 
distance, [the factor (1 + Zdcc) comes from the fact that r s is mea- 
sured in the comoving frame]. This exact degeneracy of the CMB, 
known as the geometric degeneracy, prevents CMB experiments 
from giving any constraint on the dark energy without adding other 
observables. Numerical estimates of CMB Fisher matrices, based 
on codes such as CM BFAST, fail a t accur ately reproducing this de- 
generacy (see e.g. iKosowskv etafl < l2002l) ) and tend to give unreal- 
istic results solely because of numerical uncertainties. We therefore 
follow the approach of the DETF report and Rassat et al. (2009) to 
estimate the Fisher matrix over a 'natural' set of cosmological pa- 
rameters (Q m h 2 , Q^h 2 , l a , A|j, n s and r ). We then marginalize 
over t and convert the Fisher matrix into our preferred parameter 
set using the Jacobian matrix of the transformation. 
In Table 4 we quote the resulting constraints for Planck alone with 
or without the use of polarization. Because of the geometrical de- 
generacy, only constraints on the simplest ACMD models can be 
obtained, however we have also estimated the full Fisher Matrix for 
the w(z)CDM model, since it is necessary to derive the combined 
constraints from the Planck CMB spectra with the cluster observ- 
ables. This is then simply achieved by adding Eq. l |19l > to Eq. d!5l >. 



6 PREDICTED CONSTRAINTS ON THE 
COSMOLOGICAL PARAMETERS 

Results from the Fisher analysis for the equation of state of the 
dark energy are presented in Tables [5] and [6] for the A2 and B0 
survey configurations. We display the ultimate accuracy that can 
be reached for the most general, non flat, w(z)CDM cosmology. 
We outline below the main outcome of the study. 

(1) The comparison between the CI and C2 populations (limited 
to < z < 1) shows an improvement on wq, w a of about 20, 10% 
for the C2 sample. The C2 clusters are roughly twice as numerous 
as the CI, but less massive in average so that their impact on 
cosmological measurements is expected to be indeed relatively 
smaller. 

(2) Focussing on the Survey-A configuration, the C20 clusters are 
four times more numerous than the CI and some 250 of them are 
between 1 < z < 2. The net effect is an improvement better than a 
factor of two on w a and wq. 



(3) The comparison between the B and A survey designs for 
the C2 and C20 populations respectively shows comparable 
constraints when dn/dz, £ and Planck are combined (optimistic 
and pessimistic cases). However, the total number of clusters 
involved is 2320 for B compared to only 1400 for A. This stresses 
the efficiency of the 1 < z < 2 clusters f or characterising the dark 
energy (see also lBaldi & Pettorinol d2010l) ). 

(4) Table [7] lists the constraints expected after the first scan of 
survey A, thus at 1/4 of its nominal depth (C2 population only and 
measured in pessimistic conditions): the accuracy is about half of 
that at full depth, hence along the line of the signal oc V time 
ratio. 

We have further investigated the role of various hypotheses that 
were made in the prescription of the Fisher analysis. 

(5) This study is amongst the first ones to qualitatively consider the 
added value of the cluster spatial distribution in the determ i nation 
of th e DE parameters (see also iMaiumdar & Morirl 120041 : iHiitsil 
l20ld) . The impact of £ is highlighted in Fig. [9] It is remarkable 
given that the regions considered for Survey-A2 are only 3.5 deg 
aside, but should not be considered as unexpected. In fact, £ is 
particularly sensitive to fi m and as, thus it strongly contributes 
to breaking model parameter degeneracies. Furthermore the mass 
dependence of the halo clustering is opposite to that of the number 
counts. On the one hand, less massive halos are less clustered 
than the massive ones; on the other hand, the former are more 
numerous. Thus a combined measurement allows for a better mass 
determination of the cluster sample and directly improves the 
parameter inference. This is a clear advantage of dedicated cluster 
surveys over serendipitous searches. 

(6) Introducing a prior of 10% on the Hubble constant does not 
significantly improve w a , rao for the final dn/dz + £ + Planck 
settings but some 40, 20% better constraints are predicted when 
only dn/dz + Planck are considered. 

(7) We have examined the case where the M-L relation is perfectly 
known at all redshifts: we observe an improvement of less than 
15% both on wo and w a for the C20 population with the optimistic 
assumption. 

(8) We have further investigated what happens if the dispersion 
in the M-L relation (which can be interpreted as the dispersion 
in any mass-observable relation) is decreased from 0.6 to 0.1, 
re-computing the priors accordingly. In this case, the improvement 
is ~ 10%; assuming in addition that the M-L relation is perfectly 
known leads to a negligible improvement. 

(9) We have assumed that the cluster luminosities evolve self- 
similarly, which tend s to be supported by current observations 
( Maug han et al . 2008). Other scaling laws can be assumed like, for 
instance, no evolution, which implies that distant clusters are less 
luminous than in the self-similar hypothesis: this would decrease 
the number of detected high-z clusters. The impact of the cluster 
evolution hypothesis can be bracketed by the extreme case were no 
2 > 1 clusters are detected; in this case, the optimistic constraints 
on wo, w a would change from 0.40, 1.29 to 0.51, 1.67. 

(10) Finally, assuming a flat w(z)CDM cosmology improves the 
determination of wo and w a by about 5%. For a flat wCDM, 
we predict a precision of 0.040 for w with the C20 optimistic 
configuration (Survey-A) . 

A general summary of the expected dark energy parameter un- 
certainties from future cluster surveys has been presented in th e 
Dark Energy Task Force (DETF) document dATbrecht et al. f2 006). 
This review study classifies the projected performances of cluster 
surveys into stage II, III and IV. Stage II corresponds to surveys 
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Figure 9. Illustration of the impact of the cluster-cluster correlation function - A2 Survey configuration (from Table|5J. Left: \-a cosmological constraints 
from the cluster number counts only. Right Adding the correlation function. The black contours indicate the predictions by Dark Energy Task Force, stage IV . 



of 200 deg 2 with a mean mass threshold of 10 14 /i _1 M Q detect- 
ing approximately 4000 — 5000 clusters, and for which the ex- 
pected errors on the dark energy parameters are a WQ = 1.1 and 
<J Wa = 3.2. Stage III consists of surveys covering 4000 deg 2 with a 
mean threshold of 10 14 - 2 /i _1 M Q detecting ~ 30, 000 clusters. Fi- 
nally Stage IV corresponds to surveys covering 20, 000 deg 2 with 
a mass threshold of 10 14 - 4 /i _1 M Q and providing also 30, 000 clus- 
ters.The DETF predictions for stage III and IV are recalled in Table 
[8j they are comparable for both stages as each of them appear to be 
dominated by systematics. 

These projections have been derived under a number of assump- 
tions which differ from ours. First, the h alo mass function ha s been 
assumed in the fitting form provided by I Jenkins et all JiOQlh . Sec- 
ond, the settings of the Fisher analysis are also slightly different: 
while both studie s involve the same num ber of parameters, the anal- 
ysis presented bv lAlbrecht et al. IfeOOd) assumes a prior of ~ 10% 
on the Hubble constant - we do not (they also consider the Sq pa- 
rameter (k 3 P^/2Tv 2 ) in place of erg). Conversely, they use only 
number counts - we consider, in addition, the correlation function. 
The DETF adopts a constant mass selection, and masses are sup- 
posedly determined through "self-calibration", i.e. a functional de- 
pen dence between flux (or rich ness), mass and redshift is assumed 
(see lMaiumdar & Mohr 2004). The DETF has further assumed a 
root mean square error in the mean/variance of mass per redshift 
bin ranging from 2 — 14% for stage III and 1.6 — 11% for stage IV. 
Despite these differences, it is worth comparing the performances 
advocated by the DETF with our predictions. A quick glance at 
Tables [5] and [8] immediately reveals that the XXL pessimistic pre- 
dictions outperform the DETF pessimistic ones and that XXL op- 
timistic ones lay between the optimistic and pessimistic DETF cal- 
culations. This is a somewhat unexpected result given the ratio of 
the surveyed areas (a factor of 80-400) but is readily understand- 
able as the effect of the mass accuracy and of the presence of z > 1 
clusters, a direct consequence of the XMM deep exposures. We 
further compare the virtue of the XXL cluster population with the 
other cosmological probes examined by the DETF, namely : baryon 
acoustic oscillations, supernovae and weak lensing measurements. 
The comparisons are displayed in Fig.llQI 



7 DISCUSSION AND CONCLUSIONS 

We have shown through a Fisher matrix calculation that the XXL 
Survey-A (and B) can provide measurements of the cluster num- 
ber counts and 2-point correlation function of sufficient precision 
to provide useful constraints on the equation of state of the dark 
energy. In our analysis, special care has been devoted to the re- 
alistic modeling of the statistical uncertainties (sample variance 
and shot noise) due to the small size of the surveyed area (50 or 
200 deg 2 ) and of the cluster mass measurements. Our experience 
gained with XMM has allowed us to consider realistic cluster se- 
lection functions and to apply priors on individual cluster mass 
measurements. We have favored this approach against the use of 
'self-calibration' techniques, intended to by-pass the current igno- 
rance about the evolution of the cluster scaling law by simultane- 
ously fitting its functional form with cosmology. After all, self- 
cal ibration has not bee n observationally tested yet, and as shown 
by ISahlen et alj J2009I) . it is hampered by the fact that it intro- 
duces a latent degeneracy between the dispersion in the scaling 
laws and their redshift ev olution. Moreover, it has been pointed 
out bv lPacaud et al.1 d2007h that, in the case of X-ray flux measure- 
ments, emission lines produce discontinuities which cannot be sim- 
ply accounted for by the parametrized functional dependence. On 
the basis of these considerations we have deliberately assumed to 
individually measure cluster masses, and improve the X-ray mass 
derivation by means of S-Z and weak lensing observations. This is 
a reasonable working assumption given the relatively limited size 
of the surveyed area and the results will form the ideal basis for in- 
vestigating, a posteriori, self-calibration techniques. In the present 
analysis we have let the normalization of the scaling relation to be 
free for each Az = 0.1 redshift bins. Alternatively, taking larger 
bins (e.g. Az = 0.2) decreases the number of free parameters by a 
factor of two. This would allow the introduction of e.g. two more 
free parameters so as to enable the simultaneous fit of the evolution 
of the slope and of the dispersion of the relation. These hypotheses 
will be discussed in a subsequent article (paper II, Pacaud et al, in 
prep.). In this forthcoming work, we shall also compare the relative 
efficiency of various cluster selection functions (such as those pre- 
sented here and a fixed mass limit at any redshift), investigate the 
role of plausible evolution laws other than self-similarity, examine 
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Figure 10. Left: the 1-cr contours in the wa-w a plane for the various DETF probes at stage III (pessimistic assumptions) along with the XXL predictions 
for the CI and C20 populations (pessimistic case from Table[5). The contours for the DETF probes have been derived using the Fisher matrix data from the 
DETFast code, including the Planck priors in the same way as for XXL. Right, same as Left for the DETF stage IV and optimistic assumptions. 



the impact of the DE inhomogeneities on the halo mass function, 
discuss the added value of the evolution of £ and, especially, that of 
the cluster mass function (dn/dMdz) in constraining the DE equa- 
tion of state. 

In any case, our analysis demonstrates that a medium deep 50 deg 2 
survey with XMM - a modest project compared to the DETF stage 
IV requirements - is in a position to fulfill competitive expecta- 
tions in terms of cluster cosmological studies, while providing con- 
straints which are complementary to those expected from other 
probes. Moreover, from a practical point of view, compared to the 
cluster surveys advocated by the DETF (Stage III and IV) the XXL 
survey contains some 20 times less clusters, which makes the sam- 
ple much more tractable. 

We have shown that the Survey-A and Survey-B configu- 
rations provide equivalent constraints on the DE for a similar 
amount of XMM observing time (~ 20 Ms). Practically, we favor 
configuration A over B as, besides constraining the properties of 
dark energy, it is observationally more advantageous. There are 
also a number of compelling arguments as to the "legacy value" of 
Survey-A, which make it more appealing. Let us review them in 
some detail. 

- The aimed mass accuracy (to be complemented by a joint analysis 
of S-Z and weak lensing surveys), for all clusters entering the 
analysis, will have an invaluable scientific potential for the study 
of baryon physics. In particular, it will provide the long expected 
scaling law evolution out to a redshift of ~ 1.5 and to a mass 
M200 ~ 1O 14 M0. XMM pointed observations cannot achieve 
such an efficient determination for the simple reason that few X-ray 
clusters, and only massive ones, are known at z ~ 1. In contrast 
the Survey-A configuration has the ability to detect and reliably 
measure the signal from these objects in one single shot. This will 
provide very useful calibration data for other surveys (e.g. DES, 
eRosita), which are expected to cover much large r areas but at 
lower depth and poorer X-ray angular resolution iPredehl et al.1 
2006). Then the self-calibration method will be easily testable. 

- The spatial distribution of X-ray AGNs, which will constitute 



more than 90% of the sources of the planed survey, will be studied 
on very large scales as a function of their spectral properties. 

- For visibility reasons and observation programming, we favor the 
splitting of Survey-A in two or four sub-regions spread in right 
ascension. Furthermore, the XMM observations can be scheduled 
over four years, with each field being entirely covered by 10 ks 
XMM observations every year. The first year scan could already 
provide the full CI + C2 cluster catalogue, hence measurements 
of £ and dn/dz and constraints on the DE to an accuracy half 
of the final value. The three subsequent scans will then increase 
the number of X-ray photons down to the nominal 40 ks depth, 
thus providing the spectral accuracy and, finally, the cluster mass 
accuracy required for the full cosmological analysis. 

- Spreading the XMM observations over four years can provide 
unrivaled information about AGN variability over large timescales 
as a function of the spectral properties and environment. 

Finally, in addition to the important added value of £, we 
mention a number of arguments leading to favor contiguous 
surveys with respect to serendipitous cluster searches: 

- Operationally it is much more efficient to perform a joint X-ray + 
optical/lensing + S-Z survey than to undertake a pointed follow-up 
of X-ray clusters. And obviously, a joint optical survey renders the 
X-ray source identification straightforward. 

- Homogeneous wide surveys, compared to serendipitous searches, 
highly simplify the derivation of the selection functions which, as 
shown here, play a critical role for cosmological studies. 

- Using XMM archival data would only allow the determination 
of dn/dz and it is moreover important to note that the situation is 
different from that of the ROSAT serendipitous searches. ROSAT 
had a two-degree diameter field of view (against 30 arcmin for 
XMM) and a significant fraction of the known cluster population 
has been imaa ed by XMlvfl. This introduces complex biases that 

7 Over 1 000 cluster observations performed. Out of the some 1600 obser- 
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cannot be removed by simply discarding the central target or 
ignoring the target clusters as was routinely assumed in the past; it 
is especially serious at high redshift since only the X-ray brightest 
known clusters were considered as targets. 

- The proposed homogeneous survey will also enable the deter- 
mination of the structure of the X-ray background on very large 
scales at energies ranging from 0.1 to 10 keV. In addition, once 
the cluster population is detected and the redshifts measured, their 
3-D distribution will enable the identification of putative cosmic 
filaments. Staking the X-ray data corresponding to the location of 
many filaments then could lead to the first detectio n of the Warm 
Hot Intergalactic Medium in emission dSoltanll2008l) . 

One of the interesting outcomes of the present study is to have 
quantitatively estimated the impact of the cluster-cluster correla- 
tion function in dark energy studies. We leave to future studies the 
possibility of measuring the evolution of the cluster mass function 
dn/dMdz rather than dn/dz with the XXL survey, as well as the 
combination with the low-z REFLEX correlation function and the 
Planck cluster number counts + correlation function. In the future, 
one can also well imagine constraining cosmology directly by ap- 
plying the X-ray selection function on a large set of hydrodynam- 
ical simulations - when these become achievable - and match the 
properties of the resulting simulated cluster catalogues to that of 
the observed XXL one. Such methods, w hich are already applied 
on the Lya forest dViel & Haehneitll2006t) would allow one to to- 
tally by-pass the determination of the cluster mass-observable rela- 
tions as a function of redshift. 
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vations available with exposure time longer than 40ks (any type of targets), 
about 400 useful images remain when considering only high galactic lati- 
tude public observations and assuming a flaring rate of 25% (status of the 
XMM archive by September 2010) 
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Table 5. Cosmological constraints. Survey configuration A2 - 50 deg 2 full depth (40 ks XMM exposures) \-a errors on u>o / w, 



Pessimistic mass measurements Optimistic mass measurements 

Selection Redshift range dn/dz + Planck dn/dz + £ + Planck dn/dz + Planck dn/dz + £ + Planck 



CI 0<2<1 2.38/5.08 0.88/2.71 1.98/4.15 0.78/2.32 

C2 0<2<1 2.00/4.64 0.72/2.36 1.70/3.89 0.65/2.06 

C20 0<2<2 1.19/2.59 0.45/1.46 0.87/1.82 0.38/1.18 



Table 6. Cosmological constraints. Survey configuration B0 - 200 deg 2 full depth (10 ks XMM exposures) 1-tr errors on too / Wa 







Pessimistic mass measurements 


Optimistic mass measurements 


Selection 


Redshift range 


dn/dz + Planck dn/dz + £ + Planck 


dn/dz + Planck dn/dz + £ + Planck 


CI 


< z < 1 


1.58/3.30 0.54/ 1.71 


1.33/2.72 0.48/ 1.47 


C2 


< z < 1 


1.42/3.29 0.47/ 1.60 


1.13/2.52 0.40/1.29 



Table 7. Cosmological constraints. Survey configuration A2 - 50 deg 2 1/4 depth (10 ks XMM exposures) l-a errors on wq I w a 



Selection 


Redshift range 


dn/dz + Planck 


dn/dz + £ + Planck 


CI (pessimistic) 


< 2 < 1 


2.77/5.98 


0.97/3.08 


C2 (optimistic) 


< z < 2 


1.14/2.44 


0.55 / 1.70 



Table 8. Cosmological constraints from clusters following the DETF survey designs l-a errors on wo I w a 



Stage Pessimistic Optimistic 



HI 0.70/2.11 0.26/0.77 
IV 0.73/2.18 0.24/0.73 
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APPENDIX A: ADOPTED HALO MASS FUNCTION 

Early mode ling of the mass function relied on sem i-analytical 
approaches dPress &Schechterlll974l :l Bond et al however 
comparison with N-body simulations showed discrepancies with 
the numerically estima ted function, and a simu lation calibrated for- 
mula was proposed by Shet h & Tormenl dl999h . Over the years the 
increasing resolution of numerical simulations has led to more ac- 
curate estimations of the halo mass fun ction, and the stand ard of 
accuracy has been set by the analysis of ljenkinse t al. (200l]). The 
authors of this study have provided a 'universal' (hence applicable 
to different cosmologies and at different redshifts) fitting formula 
that is accurate to within 20%. Recent studies have cast doubts 
on the universality of the mass function. In particular the analy- 
sis bv lTinkeretal] j2008h has shown important deviations in the 
high mass end and at high redshift. Nonetheless these authors have 
been able to provide a fitting formula accurate to < 5% at z — 
and to < 20% at z = 1.25, while degrading to 50% only at 
2 = 2.5. In our analysis we assume their fitting halo mass func- 
tion, parametrized in terms of the halo mass enclosed in a radius 
containing 200 times the critical density of matter, M2000 with 
the following functional form, 



Table Bl. Surveys extracted from the Pinocchio simulations. Fields AO, Al, 
A2 pertains to different configurations of Survey-A totalling 50 deg 2 . Field 
Zl covers 10 000 deg 2 and is used for statistical comparison. 



A 



+ 1 



-c/a 2 



(Al) 



where A = A (l + 2)" ' 14 , a = o (l + z)-°- 06 , b = b (l+z)- a 
a ndloKina = -f0. 75/loK ln (2.67/U^z))] 1 - 2 , (see Eqs. (3)-(8) 
in Tin ker et al . 2008). In Table 2 of the same paper, values of the pa- 
rameters Ao, ao, bo and c are provided for several density contrasts 
A m , defined with respect to the mean matter density. Following the 
guidelines of their Appendix B, we perform spline interpolation be- 
tween the individual parameter values to match our mass overden- 
sity convention A m = 200/f2 m (z) at any given z. This ensures 
that the mass definition of our cosmological modeling matches the 
convention used for cluster scaling relations and thus for our selec- 
tion function. 

It has recently been pointed out that DE leaves characteristic im- 
prints on the non-linear phase of collapse of halos. These imprints 
manifest in the non-linear power spectrum as well as in the halo 
mass functi on and may yield u p to 20% deviations from ACDM 
predictions dCourtin et al]201Ch . In paper II (Pacaud et al, in prep.) 
we shall investigate how this would impact the predicted DE con- 
straints. 



APPENDIX B: EVALUATING THE SIGNIFICANCE OF 
dn/dz AND £ FOR VARIOUS SURVEY CONFIGURATIONS 

We us e the publicly availab le Pinocchio package dMonaco et al. I 
2002a b; Taffoni et alj|2002h to generate 3D cluster catalogues for 
a given initial density field realisation and cosmology. We use the 
2.2-beta version, that is now entirely paralleled and available from 
the authors on demand. Pinocchio, while following the procedure 
of N-body simulations, works in the Zel'dovich approximation, 
allowing for faster computation by several orders of magnitude 
with respect to equivalent N-body simulations (in terms of mass 
resolution and volume probed). Confronting the Pinocchio re- 
alisations with the h igh resolution full-sky Horizon simulations 
dTevssier et~ai]|2009l) (in the case of a ACDM model best-fit to 
WMAP-3 years data) we have checked that the Pinocchio cluster 
mass function is accurate to 10%, and that the 2-point correlation 
function can be reliably estimated down to 10/i _1 Mpc scale. We 
observe however, a slight increase of £ around this scale, as the 



survey configurations 


AO 


Al 


A2 


Zl 


Total surveyed area (deg 2 ) 


50 


50 


50 


10 000 


number of sub-fields 


1 


2 


4 


4 


sub-field side (in deg) 


7.07 


5 


3.54 


50 


number of independent 










simulated sub-fields 


190 


215 


230 


30 



unresolved clusters tend to accumulate at this point. We illustrate 
below our procedure considering the C2 selection for various 
configurations totalling the 50 deg 2 of Survey-A. 

Using Pinocchio we generate 5 cosmic volumes with dif- 
ferent random initial c onditions for a ACD M model best-fitting 
WMAP-5 years data dDunklev et all [2009). Each volume is a 
box of 3500 x 3500 x 3500 comobile Mpc 3 observed from the 
corners, providing 8 past-lightcone octants. These octants are 
combined, using the periodicity of the volumes, to finally provide 5 
full-sky past-lightcones independant from each other. The physical 
position of each simulated halo is corrected for its peculiar velocity 
since the correlation function is computed in redshift space. In 
order to estimate the halo 2-point correlation function 5 bootstrap 
full-sky lightcones are generated from the data. The angular 
position of each halo is randomized 10 times to artificially create 
lightcones containing 10 times more halos than the original data. 
The redshifts of the original Pinocchio simulated data as well as 
the mass probability distribution function are conserved in these 
"random" lightcones. 

From these lightcones we extract a large number of XXL survey 
realisations. We considered several survey configurations: a single 
7.07 x 7.07 deg 2 field, and configurations consisting of two 
5x5 deg 2 and four 3.54 x 3.54 deg 2 patches respectively. The 
last two configurations are more likely to correspond to actual 
observations since spreading patches in right ascension ensures a 
more efficient observation scheduling. Also splitting the survey 
into several sub-fields is usually expected to decrease the impact 
of the sample variance; an effect that we quantitatively estimate 
hereafter. In order to avoid large-scale correlations, edges of the 
extracted sub-fields are separated by at least 30 deg in RA and Dec. 
We also extracted survey fields covering 50x50 deg 2 for statistical 
comparison. The characteristics of the different survey realisations 
are given in Table IB ll 

In each simulated sub-field, we compute dn/dz and £, for a 
given selection. The 2-poi nt correlation fuction is measured using 
the estimator introduced bv lLandv&Szalavldl993l) . The results are 
then combined according to each of the survey configurations il- 
lustrated in Table IB ll e.g. for the A2 design, individual dn/dz are 
summed over the 4 patches, while individual £ are averaged over 
the ensemble. Then for each configuration, the resulting quantities 
are averaged over all realisations. The la errors about the average 
dn/dz and £ are computed as a function of z and R for each of the 
4 survey configurations, including the signal-to-noise ratio. As we 
describe in Section [5] we use the estimated values of S/N to deter- 
mine the experimental uncertainties necessary for the Fisher matrix 
analysis. Results are summarized in Figs. IB H and IB2I As it can be 
appreciated from Fig. IB II the cluster number counts turn out to be 
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Figure Bl. Signal-to-noise ratio for dn/dz as a function of redshift, ob- 
tained for different realisations of Survey-A and the C2 selection using 
Pinocchio simulations; bin size Az = 0.1 

insensitive to the sub-field splitting of the survey design, i.e. a sin- 
gle 7.07 x 7.07 deg 2 field (Al), two 5 x 5 deg 2 sub-fields (Al) 
or four 3.54 x 3.54 deg 2 sub-fields (A2). The 2-point correlation 
function appears to be slightly dependent on the size of the sub- 
fields but the impact on the S/N is negligible. The 50 x 50 deg 2 Zl 
reference realization indicates that it is possible to reliably compute 
£ at least out to 40 Mpc/h for the AO, Al or A2 configurations. We 
note that 40 h~ Mpc is slightly smaller than the comoving length 
encompassed by the A2 realisation at the survey maximum sen- 
sitivity (3.54 deg at z = 0.3 corresponds to 53 h~ 1 Mpc scale). 
We sample £ with a scale separation > 10/i _1 Mpc because of the 
limited resolution of the Pinocchio simulations. Since cluster virial 
radii are on the order of lh^ 1 Mpc, this implies that we may be 
loosing some power on scales of ~ 5 — lO/i -1 Mpc, where merg- 
ers are expected to occur. 

The Pinocchio experiment indicates the AO, Al and A2 configura- 
tions are equivalent in terms of S/N both for dn/dz and £. In the 
paper we consider the A2 configuration, which is for observational 
reason the easiest to perform, when presenting the results of the 
cosmological analysis. 




10 15 20 25 30 35 40 

distance R[Mpc/h] 



Figure B2. Top: 3D averaged 2-point correlation function for different 
Survey-A realisations extracted from the Pinocchio simulations using the 
C2 selection function. The bin size is dlogi? = 0.1. Bottom: Correspond- 
ing signal-to-noise ratio. 



